Statistical strategies for constructing health risk models with multiple pollutants and their interactions: possible choices and comparisons
نویسندگان
چکیده
BACKGROUND As public awareness of consequences of environmental exposures has grown, estimating the adverse health effects due to simultaneous exposure to multiple pollutants is an important topic to explore. The challenges of evaluating the health impacts of environmental factors in a multipollutant model include, but are not limited to: identification of the most critical components of the pollutant mixture, examination of potential interaction effects, and attribution of health effects to individual pollutants in the presence of multicollinearity. METHODS In this paper, we reviewed five methods available in the statistical literature that are potentially helpful for constructing multipollutant models. We conducted a simulation study and presented two data examples to assess the performance of these methods on feature selection, effect estimation and interaction identification using both cross-sectional and time-series designs. We also proposed and evaluated a two-step strategy employing an initial screening by a tree-based method followed by further dimension reduction/variable selection by the aforementioned five approaches at the second step. RESULTS Among the five methods, least absolute shrinkage and selection operator regression performs well in general for identifying important exposures, but will yield biased estimates and slightly larger model dimension given many correlated candidate exposures and modest sample size. Bayesian model averaging, and supervised principal component analysis are also useful in variable selection when there is a moderately strong exposure-response association. Substantial improvements on reducing model dimension and identifying important variables have been observed for all the five statistical methods using the two-step modeling strategy when the number of candidate variables is large. CONCLUSIONS There is no uniform dominance of one method across all simulation scenarios and all criteria. The performances differ according to the nature of the response variable, the sample size, the number of pollutants involved, and the strength of exposure-response association/interaction. However, the two-step modeling strategy proposed here is potentially applicable under a multipollutant framework with many covariates by taking advantage of both the screening feature of an initial tree-based method and dimension reduction/variable selection property of the subsequent method. The choice of the method should also depend on the goal of the study: risk prediction, effect estimation or screening for important predictors and their interactions.
منابع مشابه
Correlation of air pollutants with land use and traffic measures in Tehran, Iran: A preliminary statistical analysis for land use regression modeling
Land use regression (LUR) models have been globally used to estimate long-term air pollution exposures. The present study aimed to analyze the association of different land use types and traffic measures with air pollutants in Tehran, Iran, as part of the future development of LUR models. Data of the particulate matter (PM10), sulfur dioxide (SO2), and nitrogen dioxide (NO2) were extracted from...
متن کاملRelationship Between Iranian L2 Learners’ Multiple Intelligences and Language Learning Strategies
L2 learners’ multiple intelligences (MI) profile plays a central role in theirperformance on different aspects of language learning, one of which is the use oflanguage learning strategies (LLSs). Gaining insights into the relationship betweenMI and LLSs makes L2 teachers better understand their learners’ strengths andweaknesses in the use of such strategies and lets them guide the learners bett...
متن کاملEstimation of target hazard quotients for metals by consumption of fish in the North Coast of the Persian Gulf, Iran
In the residential area of the North Coast of the Persian Gulf, consumption of fish is a possible source of exposure to heavy metals and other pollutants, all of which may act as potential risk factors for serious syndromes and fatal diseases. Health risks associated with Pb, Cd, and Hg were assessed based on the target hazard quotients (THQ), which can be derived from concentrations of heavy m...
متن کاملAssessment of the Relation between Students' Gender and Their Scores on Selecting Confidence Choices in Confidence-Based Exams
Introduction: There are various ways such as confidence-based exams to eliminate lucky guesses on a multiple choice question test. In this study the relation between students’ gender and their score on selecting confidence choices in confidence based exams was assessed. Methods: This was a descriptive retrospective study. It was done on all of the medical students taking Biochemistry course du...
متن کاملAn Ecological Study of the Association between Opiate Use and Incidence of Cancers
Background: Cancer is the second leading cause of death after cardiovascular disease. In recent years it has been hypothesized that opiate use could be a risk factor for cancer. This study aimed to evaluate a possible association between opiate use and common cancers using ecological statistics from around the world.Methods: To investigate the association we used ordinary linear regression mode...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 12 شماره
صفحات -
تاریخ انتشار 2013